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.tsp Xfaol Start codon 

GTTTTAAATAC GCTCGAG GTTTTAAAT ATG TCT GTT GCC TTG TTA TGG GTT GTT TCT CCT TGT GKC 

Met Ser Val Ala Leu Leu Trp Val Val Ser Pro Cys Asj 



Transit peptide of ph ytoene synthase 

GTC TCA AAT GGG ACA AGT TTC ATG GAA TCA GTC CGG GAG GGA AAC CGT 
^al Ser Asn Gly Thr Ser Phe Met Glu Ser Val Arg Glu Gly Asn Arg 




Figure 1 
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BamHI 




pBR322 



Figure 2 
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.tsp Xhol Start codon 

GTTTT^AATACG£I£2AG.TGCAGC ATG GAA ACC CTT CTA AAG CCT TTT CCA TCT CCT TTA CTT — 1 

Met Glu Thr Leu Leu Lys Pro Phe Pro Ser Pro Leu Leu 

I Transit peptide of capsanthin-capsorubin synthase 

•-►TCC ATT CCT ACT CCT AAC ATG TAT AGT TTC AAA CAC AAC TTC ACT TTT 

Ser lie Pro Thr Pro Asn Met Tyr Ser Phe Lys His Asn Ser Thr Phe 




Figure 3 



4/28 



Asp Xhol Start codon 

GTTTfMATACGS£SMCC ATG OCT TCC TCA GTT CTT TCC TCT GCA GCA GTT GCC ACC CGC* 

Met Ala Ser Ser Val Leu Ser Ser Ala Ala Val Ala Thr Arg 



RUBISCO SSU Chloroplast Transit Peptide (M tabacum) 

AGC AAT GTT GCT CAA GCT AAC ATG GTT GCA CCT TTC ACT GGC CTT 
Ser Astt Val Ala Gin Ala Asrt Met Val Ala Pro Phe Thr Gly Leu 



BamHI 



Phytoene SnttBg 
Xhol 




TMV RNA (+6734) 
Kpnl 




TTU51 CTP CrtB 

(11.6 kb) 




pBR322 



TMV RNA (+1)1 



126K 



68 nt 



SP6 promoter 



Figure 4 



5/28 



13 PaeR7l 
13 Xho! 
190 Sph I 




5702 Bbe I 
5702 Ehel 
5702 Kas I 
5702 Narl 
5968 Eag I 



Figure 5 
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Seal 10414 




Sse8387l 5750 



Figure 6 
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225 Dralll 



653 Eci136H 
653 Saci 
658 BstXi 
Dsal 
SacH 



661 
661 



Xmn I 4130 
Seal 4013 



669 Not I 

670 Eag! 



707 



Bsal 3602 
Ahdl 3530 



AlwN I 3053 




Xbal 
Spe I 
EcoRI 
718 Avrll 
8 Sty I 
783 Bsml 

815 SexAl 

816 Xcml 



958 BstEll 



- 1144 Bsgl 
~" 1188 Bgllt 



1335 PflM 



1551 Bsu36l 



BspLU1 1 I 2642 
Afllll 2642 



Kpnl 2244 
Acc65l 2244 

Bsp120l 2238 „„ 

Apal 2238 2199 S P h 1 



2031 
2139 Sse8387 I 



1656 Stu 
1726 Afl II 
1757 Nrul 
1849 Aatl 
1916 Bbsl 
Bel I 



Ecc-0109 I 2237 



2208 Hind ill 



Xhoi 2229 2214 BspD I 
PaeR7l 2229 22 14 Cla ' 
Sal I 2223 
Hinc II 2223 



Figure 7 
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Pac I 5757 5455 Apa I 

5455 8sp120 I 



Figure 8 
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Eagl 10851 

Nru I 10818 




Figure 9 
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METAlallePrcXSluGluPheAspIl^^ 

TCCTTGAAAGTTGGTCTTATCGAAGCAGGTC 180 

TTGGACTC<»AG&r3^CK^ 270 

GGTC^STGCTTCTICTATCAACT 360 
ClyGiyGlySerSerlleAsnPheMEIMETryrThrArgGly 

AspLeuI^uProLeuMETLysLysTfarGlu^^ 

TCCT^CGGTAACTACaCCTACC 540 
SerPheGlyAsnTyzl'hrTyrProValO^ 

"TCPSGriTACCGCTCAl^ 630 
LeuValTteAI&HisGlyMaGiuHisT:^ 

TCESOamSftGJUUVE^^ 720 
3erThjct^ArgAsnHi«A8pA«nIie^^ 

AAAAC033TOy*AGCAAGC^^ 810 

Lyaaa^alProSfltrLysPxrel^euAanProLysLya 

IleSerSerProLeuValLeuGlr^ 

GGTKsfTGGTAGAAACTTCCAGGACCACT^ 990 
GlyValGlyArgftsrj?^ 

GGTSAaCTGAOATrCAAARGAGACT 1080 

GlyAspAiaGIuIleGlriLysArgValPheA^ 

AAGMCAGACCMCTCCT 1170 
LysIlea^grroTb^ProGluGia'STAlaGIri-ETAspGluSerPheGl 

PraValMEraisTyrSerllelleAl^ 

G^CTaCO^TTCrcrrAGAGGTTC 1350 
GluIyrProPhaSerArgGlySerlleHist 

AGAGACSTGGCTC^ATQST'TT^ 1440 
Argiusqpl^AlaPrcaa^^ 

CACCCIOTGTTOCCATACTC^^ l S3 g 
HiaProI>uPhaProTyr3erSerGiuAl*Arg^ 

<5CTCGTCTTGCACACGGTTCTO3^^ 15:o 
MaGlyLeuMaHisGlySerf^rpT^ 

CACCCAGACATCGAGTACGATCAGGAQSATGA^ mc 
HisPra&spIieGluTyrAspGluGLuA^ 

AOTXHTO^33G^ 19C0 
ThrCysSerneGlyPrs^gGluGiyS^^^ 

'ITGAAGGTTCGTGACTrGTCTGTTK?rc 1350 
LeuI>ysValGlyAspteuSerVal^ 

CTCGTpGTCAAGACTTAGGATACTCTOSTCftGGCCT^ 1380 
^va.GlyGluAspLeiiGlyTyrSe^ 

GCTAGATTCTAA 

AIsArgPheSTP _ 

Figure 10 
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Figure 11 
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.tsp Xhol Start cod on 

GTTTm&ATACGCECfiMATCAA'TCCATCTCCGAAGTSTGTCTGCAGC ATG CAG GTG CTG AAC ACC ATG* 

Met Gin Val Leu Asn Thr Met 



Wee a-amylase signal peptide 
GTG AAC AAA CAC TTC TTG TCC CTT TCG GTC CTC ATC GTC CTC ATC GTC 

Val Asn Lys His Phe Leu Ser Leu Ser Val Leu He Val Leu Leu Gly 




Figure 12 
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.tsp Xbol Start codott 

GTTT TAA ATACGg TCGAGG ATG ATC AGA TTC TTA GTC CTC TCT TTG CTA ATI* CTC ACC CTC TTC 

Met lie Arg Phe Leu Val Leu Ser Leu Leu lie Leu Thr Leu Phe 



Signal peptide -1 +1 Mature u-trichosanthin 

•"CTA ACA ACT CCT OCT GTG GAG GGC | GAT GTT AGO TTC CGT TTA TCA 
Leu Thr Thr Pro Ala Val Glu Gly | Asp Val Ser Phe Arg Leu Ser 




Figure 13 
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Sspl(4402) , 



,NaeI(i30) 




-Sspl(441) 
•PvuII(532) 




KS+/TVCVK#23 

4516 bp 



SacU(664) 

Sacl(657) 

BstX2(665) 

Notl(670) 

Xbal(677) 

Spel(683) 

BamHI(6$9) 

Smal(697) 

Pstl(705) 

EcoRI(707) 

EcoRV blunt(7i5) 



PvuH(2529) 



HindU!(227j 
SalI(22S6) ■ 
XhoI(2292) ■ 

Kpnl(231l) ■ 



EcoRV bit 



Figure 14 



15/28 



Narl 10596 
Kas I 10596 




Sgfl 666S 

Pad 5757 
Bsp120 I 5455 
Apa I 5455 



Figure 15 
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Narl 9499 



Kasl mm 256 Sma I 

She 1 9499 256 Xma I 

Bbe i 9499, . 308 Bsmi 




PmH 5820 
Xhol 5769 
PaeR7l 5789 
Eag i 5763 
Not! S762 



Ncof S4S9 
BSD120 1 S455 
Apa I 54S5 



Figure 16 
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o 

£3 

m 

ill 
CO 

5 

a 

ru 
o 
m 
a 
m 



220 Dra III 
223 BsaA i 
326 Nae 1 



Xmn I 3515 
Aha II 3456 
Seal 3398 



Fspl 3140 

Bsa I 2987 
Ahdl 2915 H 




653 BstXI 
656 Sac II 

664 Not I 

665 Eag I 
672 Xba I 
678 Spe I 
691 Smal 
691 Xmal 

809 BstS I 
827 Hindi 
627 Sal 1 
670 Mun I 



f 982 Bst1 107 I 
982 Sna I 
982 Xca I 
1108 Mlul 
1154 BsrG I 
1201 Nae l 
12C8 8spE I 
1261 Mscl 
1309 Hind III 



AlwN I 2438 



1437 Sph i 
1439 Nsi I 
1522 Afl II 
1584 Psil 
1629 Acc65 I 
1629 Asp718 
1629 Kpn I 



Figure 17 
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Bsp120I 5455 
Apa I 5455 



Figure 18 
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Bguio 




Figure 19 
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Nucleotide sequence alignment of 740 AT #120 to human ADP-ribosylation factor 

(ARF3) M33384 



740 AT #120 JUkG2UlG<^CaTGCG2^ 

II lllllll Mill II IIIIHII II II Mill II II 11111111 II) I 

M33364 AAGAAGGAGATGCGC^TCCTGATGGTGGGeCTGGATGCeGCAGGAAAGACCACCM 

KKEMRILMVGLDAAGKTTIL 

740 AT # 1 20 mCAAGCTCAAGCTCGGAGaGATTGTG&CC&CCATCC 

lllillll II II II lllll llllllllllllllll! Mill MMIMIMI 

M3 3 3 8 4 TACAAGCTGAAACTGGGGGAGATCGTCACCIACCATCCCTACGATTGGGTTCA 

YKLKLGEIVTTIFTIGFNVE 

740 AT #120 ACTGTGGAATACAAGAACATTAGTT'PCACCGTGTGGGATGTGGGGGGTCAGGACAAGATC 

1 1 Mill II IIMIill II II II IMIMMIM II II IMIIIMIM 

M33384 " ACACTGGAGf A«C AAGAAC&TCAGCTCTACAGTGtPGGGATGTGGOT 

TVEYKNISPTVWDVGGQDKI 

740 AT #120 CGTCCCTTCKPG— AGACACTACTTCCAGAACACTG 

1 1 111 I 1 1 M Mill IIMMIIIMM lllll I II Mill II II Ml 

M3 3 3 8 4 CGACCCCTCl'GGAGACACTACTTCCAGAACACCCAAGGGCT 

RFLWRBYFQNTQGLIFVVDS 

740 AT #120 AATGACAGaGACAGAGTTGTTGAGGCTCGAGATCAACTCC^C^ 

Mill I 1 1 MM 1H11I M II II M 1 1 11 II 11 MIMI 

M33384 AATGATCGGGAGCGAGTAAATGAGGCCCGGGAAGAGCTGATGAGAATGCTGGCGGAGGAC 
NDRERVNEAREEBMRMLAED 



740 AT #120 GAGC!TGCGTCATGGTGtPGTTGGTTOTOT!WfGCC^ 

Mill II I M 11 M I I Mill Mill MM) II Mill M III I II I II 

H33384 GAGCTCCGGG&TCCTCTACTCCOTOTC^^ 

ELRD^VLLVEANKQDLPNAM 

740 AT #120 AACGCTGCTGAAATCACAGATAAGGTTGGCCTTCACTCCCTCCGTGAGCGTCATTGGTAT 

I M 11 M M 1 1 lllillll Mill Mill M Mill Mill III 1 Mill 

M33384 AACGCTGCTGA<^TCACAGAC^GCTGGGCCTGGATTCCCTTCGTCAGCGTAACTGGTAC 
NAAEITDKLGLHSLRHRNWY 

740 AT #120 ATCCAGAGCAGATGTGCCACTTCAGGTGAAGGGCTTTATGAAGGTCTGGACTGGCT 

1 1 Ml Ml lllillll II II lllll II lllll MMIMIMI 

M33384 ATTCAGGCCACCTGTGCCACCAGCGGGGRCGGGCTGTACGAAGGCCTGGACTGGCT 
IQATCATSGDGLYEGLDWL 



Figure 20 
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Kpnl(20i4) 

Pstl(2062) 



pUC19 



Figure 21 
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Nucleotide sequence alignment of 740 AT #88 to L33574 
mJRNA for rhodopsin 



OUlCCTCCCGTTGGTGTTCCTCCTCCTCaAGGTTATCCACCGGAGGGATATCCAAAAGAT 

IIIIIIIIIIJ I II II Mill II ill III II II Mil HUH) III Hi II IN 

CAACCTCCCGTCGOTGTTCCTCCTCCTCAA^^ 

GCOTATCC^CC&CAAGG&TATCCTCCTCAGG<STATCGTCAGCAAG GCTATCCACCTCA 

mi mi i in iiiiiiiiiiiiiiiiiiiimin it ii in in iiimiii 

GCTTATCCAGCGCAAGGATATCCTCCTCAGGGCTATCCGCCGCAAGTACCCTCCACAGCA 
GGGATATCCTCAACAAGGTTATCCTCAGCAAGGATATCC 

iiiiniii i inn ii mi inn iiiii 

GGGATATCCGCCGCAAGG TACCCTCCACAAGGCTATCC 

Identities = 4SS7 (78%), Positives = 45/57 (78%) 



Figure 22 



740 AT #88 
L33574 
740 AT #88 
L33574 
740 AT #88 
L33574 
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Nucleotide sequence alignment of 740 AT #88 to X077V7 
Octopus mRNA for rhodopsio 



Identities = 45/57 (78%), Positives = 45/57 (78%) 



740 AT #88 CCAGCACAiH^TATC^ 

iiitiiniii M ii ii ii it ii ii i mum inn mm 

k* X07797 CC&<XWC&!&G^ 

u 
O 
% 4 

rii 

u 

m 

b 
rii 
o 
in 

O 

m 



Figure 23 
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G-protein-coupled receptor 

N. benthamiana 

740 AT #88 YPPQ-GYPPQGYPQQGYPPQGYPQQGYPQQGYPPPYAPQYPPPPQASATTEQVIA 

III I I I I III llll II I I I I llllll I I II 

ATTS2938 YPPKDGYPPAGYPPAGYPPPGY-AQGYPECGYPPPQYSQAPEEKQNAGMLEGCLA 



Figure 24 
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G-protein-coupled receptor 

N. benthamiana 

740 AT #88 PPVGV-PPPQGYPPEGYPKDAYPPQGYPPQGYPQQGYPPQGYPQQGYPQQGY 

II I MIIIIIMII llllllllll I I Ml Ml I II 

octopus rhodopsin PPQGAYPPPQGYPPQGYPPQGYPPQGYPPQGAPPQVEAPQGAPPQGVDNQAY 
Identity = 34/5465%, Positives 35/52 66% 



Figure 25 
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Amino acid sequence comparison of 740 AT #2441 to tobacco RAN-B1 GTP binding protein 



Nt RAN-B1 MALPNQQTVDYPSFKLVIVGDGGTGKTTFVKRHLTGEFEKKYEPTIGVEVHPLDFFTNCG 

I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I ! M I ! 1 1 1 1 1 1 1 1 ) I ! 1 1 1 1 1 1 I [ J I ! ! 1 1 1 [ I E ! 1 ! { 1 1 1 

740 AT #2441 MALPNQQTVDYPSFKLVIVGDGGTGKTTFVKRHLTGEFEKKYEPTIGVEVHPLDFFTNCG 
Nt RAN-B1 KIRFYCWDTAGQEKFGGLRDGYYIHGQCAIIMFDVTSTTDIQECSNMAP*SLQGL*KHSQ 

IMIIIII MIIIIIIIMMIIII1IIIII1III+ + I 

740 AT #2441 KIRFYCWDTAGQEKFGGLRDGYYIHGQCAI IMFDVTARLTYKNVPTWHR-DLCRVCENI P 

Nt RAN-B1 LFFVGI KLM*KNRQVKAQ 

+ I 1+ 1111111+ 
740 AT #2441 IVLCGNKVDVKNRQVKAK 



Figure 26 
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Nucleotide sequence comparison of 740 AT #2441 to human RAN GTP-binding protein 



Human RAN ACTGGAGAGTTTGAGAAGAAGTATGAACCCACTATTGGTGTTGAGGTTCATCC 

Mill M llllllll II I 1 1 1 1 I 1 1 1 E 1 1 1 1 1 f 1 1 1 1 [ I ] 

740 AT #2241 ACTGGTGAATTTGAGAAGAAGTATGTAGCCACCTTGGGTGTTGAGGTTCATCC 
Identities = 46/53 (86%) , Positives = 46/53 (86%) 



Figure 27 
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Figure 28 



